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Abstract. Weighted automata are nondeterministic automata with numerical weights 
on transitions. They can define quantitative languages L that assign to each word w a real 
number L(w). In the case of infinite words, the value of a run is naturally computed as 
the maximum, limsup, liminf, limit-average, or discounted-sum of the transition weights. 
The value of a word w is the supremum of the values of the runs over w. We study 
expressiveness and closure questions about these quantitative languages. 

We first show that the set of words with value greater than a threshold can be non- 
oj-regular for deterministic limit-average and discounted-sum automata, while this set is 
always cj-regular when the threshold is isolated (i.e., some neighborhood around the thresh- 
old contains no word). In the latter case, we prove that the cj-regular language is robust 
against small perturbations of the transition weights. 

We next consider automata with transition weights or 1 and show that they are as ex- 
pressive as general weighted automata in the limit-average case, but not in the discounted- 
sum case. 

Third, for quantitative languages L\ and L2, we consider the operations max(Li,L2), 
min(Li, L2), and 1 — L\, which generalize the boolean operations on languages, as well as 
the sum L\ +L2. We establish the closure properties of all classes of quantitative languages 
with respect to these four operations. 
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1. Introduction 

A boolean language L can be viewed as a function that assigns to each word w a boolean 
value, namely, L(w) = 1 if the word w belongs to the language, and L(w) = otherwise. 
Boolean languages model the computations of reactive programs. The verification problem 
"does the program A satisfy the specification B?" then reduces to the language-inclusion 
problem "is La Q Lb?", or equivalently, "is La(w) < Lb(w) for all words wT\ where 
La represents the behaviors of the program, and Lb contains all behaviors allowed by the 
specification. When boolean languages are defined by finite automata, this framework is 
called the automata-theoretic approach to model-checking |VW86j . 

In a natural generalization of this framework, a cost function assigns to each word a 
real number instead of a boolean value. For instance, the value of a word (or behavior) can 
be interpreted as the amount of some resource (e.g., memory consumption, or power con- 
sumption) that the program needs to produce it, and a specification may assign a maximal 
amount of available resource to each behavior, or bound the long-run average available use 
of the resource. 

Weighted automata over semirings (i.e., finite automata with transition weights in a 
semiring structure) have been used to define cost functions, called for mal po wer series for 



finite words [Sch61l lKS86] and w-series for infinite words [CK94} IDK031 lEK04| . In [CDH08], 



we study new classes of cost functions using operations over rational numbers that do not 
form a semiring. We call them quantitative languages. We set the value of a (finite or 
infinite) word w as the supremum value of all runs over w (if the automaton is nondeter- 
ministic, then there may be many runs over w), and the value of a run r is a function of the 
(finite or infinite) sequence of weights that appear along r. We consider several functions, 
such as Max and Sum of weights for finite runs, and Sup, LimSup, Limlnf, limit- average, and 
discounted sum of weights for infinite runs. For example, peak power consumption can be 
modeled as the maximum of a sequence of weights representing power usage; energy use 
can be modeled as the sum; average response time as the limit-average |CCH + 05J ICdHS03] . 



Quantitative languages can also be used to specify and verify reliability requirements: if a 
special symbol _L is used to denote failure and has weight 1, while the other symbols have 
weight 0, one can use a limit-average automaton to specify a bound on the rate of failure in 
the long run JCGH + 08| . The discounted sum can be used to specify that failures happening 
later are less important than those happening soon |dAHM03] . 

The quantitative language-inclusion problem "given two automata A and B, is La{w) < 
Lb(w) for all words wT can then be used to check, say, if for each behavior, the peak 
power used by the system lies below the bound given by the specification; or if for each 
behavior, the long-run average response time of the system lies below the specified average 
response requirements. In [CDH08 , we showed that the quantitative language-inclusion 
problem is PSPACE-complete for Sup-, LimSup-, and Limlnf-automata, while the decidabil- 
ity is unknown for (nondeterministic) limit-average and discounted-sum automata. We also 
compared the expressive power of the different classes of quantitative languages and showed 
that nondeterministic automata are strictly more expressive than deterministic automata 
in the limit-average and discounted-sum cases. 

In this paper, we investigate alternative ways of comparing the expressive power of 
weighted automata. First, we consider the cut-point languages of weighted automata, a 
notion borrowed from the theory of probabilistic automata [Rab63 . Given a threshold 
t)6l, the cut-point language of a quantitative language L is the set of all words w with 
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(b) Infinite words 



Table 1: Closure properties. The meaning of the acronyms is described on pj6l 



value L(w) > rj, thus a boolean language. We show that deterministic limit-average and 
discounted-sum automata can define cut-point languages that are not w-regular. Note that 
there also exist w-regular languages that cannot be expressed as a cut-point language of a 
limit-average or discounted-sum automaton [CDH08J. Then, we consider the special case 
where the threshold rj is isolated, meaning that there is no word with a value in the neigh- 
borhood of rj. We argue that isolated cut-point languages are robust, by showing that they 
remain unchanged under small perturbations of the transition weights. Furthermore, we 
show that every discounted-sum automaton with isolated cut-point defines an w-regular lan- 
guage, and the same holds for deterministic limit-average automata. This question is open 
for nondeterministic limit-average automata. Finally, we consider a boolean counterpart 
of limit-average and discounted-sum automata in which all transitions have weight or 1. 
Of special interest is a proof that for every limit-average automaton with rational weights 
in the interval [0, 1] there is an equivalent limit-average automaton with boolean weights. 
Therefore, the restriction to boolean weights does not change the class of quantitative lan- 
guages definable by limit-average automata; on the other hand, we show that it reduces the 
expressive power of discounted-sum automata. 

In the second part of this paper, we study the closure properties of quantitative lan- 
guages. It is natural and convenient to decompose a specification or a design into several 
components, and to apply composition operators to obtain a complete specification. We 
consider a natural generalization of the classical operations of union, intersection, and com- 
plement of boolean languages. We define the maximum, minimum, and sum of two quan- 
titative languages L\ and L^ as the quantitative language that assigns max(Li(w),L2(w)), 
min(Li(w) , L2(w)) , and L\{w) + L/2(w) to each word w. The complement L c of a quanti- 
tative language L is defined by L c (w) = 1 — L(w) for all words w\_\ The sum is a natural 
way of composing two automata if the weights represent costs (e.g., energy consumption). 
We give other examples in Section [2] to illustrate the composition operators and the use of 
quantitative languages as a specification framework. 

We give a complete picture of the closure properties of the various classes of quantitative 
languages (over finite and infinite words) under maximum, minimum, complement and sum 
(see Tabled]). For instance, (non) deterministic limit-average automata are not closed under 
sum and complement, while nondeterministic discounted-sum automata are closed under 
sum but not under complement. All other classes of weighted automata are closed under 



One can define L c (w) — k — L(w) for any rational constant k without changing the results of this paper. 
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sum. For infinite words, the closure properties of Sup-, LimSup-, and Limlnf-automata are 
obtained as a direct extension of the results for boolean finite automata, while for limit- 
average and discounted-sum automata, the proofs require the analysis of the structure of 
the automata cycles and properties of the solutions of polynomials with rational coefficients. 
Note that the quantitative language-inclusion problem "is La(w) < Lb(w) for all words wV 
reduces to closure under sum and complement, because it is equivalent to the question of 
the non-existence of a word w such that La{w) + L c B (w) > 1, an emptiness question which 
is decidable for all classes of quantitative languages [CDH08]. Also note that deterministic 
limit-average and discounted-sum automata are not closed under maximum, which implies 
that nondeterministic automata are strictly more expressive in these cases (because the 
maximum can be obtained by an initial nondeterministic choice). 

Related work. Functions such as limit-average (or mean-payoff) and discounted sum have 
been studied extensively in the branching-time context of game theory [Sha53[ [EM79, 
ICon92[ IZP961 ICdHS03] . It is therefore natural to use the same functions in the linear- 
time context of languages and automata. 

Weighted automata with discounted sum have been considered in [DR07], with multiple 
discount factors and a boolean acceptance condition (Muller or Buchi) ; they are shown to be 
equivalent to a weighted monadic second-order logic with discounting. Several other works 
have considered quantitative generalizations of languages, over finite words |DG07j . over 
trees [DKR08] , or using finite lattices |GC03] , but none of these works has addressed the 
expressiveness questions and closure properties for quantitative languages that are studied 
here. 

The lattice automata of [KL07] map finite words to values from a finite lattice. The 
lattice automata with Buchi condition are analogous to our LimSup automata, and their 
closure properties are established there. However, the other classes of quantitative automata 
(Sum, limit-average, discounted-sum) are not studied there as they cannot be defined using 
lattice operations and finite lattices. 

2. Quantitative Languages 

A quantitative language L over a finite alphabet £ is either a mapping L : S + — > R or 
a mapping L : S w — > R, where R is the set of real numbers. 

Weighted automata. A weighted automaton is a tuple A = (Q, <?/,£, 6, 7}, where 

• Q is a finite set of states, qj € Q is the initial state, and £ is a finite alphabet; 

• iCQxExQisa finite set of labelled transitions. We assume that 5 is total, i.e., for all 
q € Q and u£S, there exists q' such that (q, a, q') G 5; 

• 7 : 5 — )> Q is a weight function, where Q is the set of rational numbers. We assume that 
rational numbers are encoded as pairs of integers in binary. 

We say that A is deterministic if for all q G Q and a 6 S, there exists (q,a,q') € 8 for 
exactly one q' € Q. We sometimes call automata nondeterministic to emphasize that they 
are not necessarily deterministic. 

A run of A over a finite (resp. infinite) word w = o\a-i . . . is a finite (resp. in- 
finite) sequence r = goCiQiO"2 • • • of states and letters such that (i) qo = qi, and (ii) 
(c/j,<7j + i,(/j + i) E S for all < i < \w\. We denote by 7(7") = V0V1 . . . the sequence of 
weights that occur in r where v,- L = j(qi, o"i+i,gi-(-i) for all < i < \w\. 



u\. 
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Given a value function Val : Q + — V R (resp. Val : Q w — y R), we say that the Va 
automaton A defines the quantitative language La such that for all w G S + (resp. w € £' 

La{w) = sup{Val(7(r)) | r is a run of ^4 over iu}. 

We assume that Val(w) is bounded when the numbers in v are taken from a finite set (namely, 
the set of weights in A), and since weighted automata are total, every word has at least one 
run and thus La(w) is not infinite. 

We consider the following value functions to define quantitative languages (they all 
satisfy the boundedness assumption above) . Given a finite sequence v = v± . . . v n of rational 
numbers, define 
• Max(w) = max{wj | 1 < i < n}; 



Last(v) 



n 



LimAvg(-u) = liminf — • V^ 

n— >oo n *- — ' 
i=0 

Oc 

for < A < 1, Disc A (i;) = J^ A 



• Sum(w) = 2_] vf, 

Given an infinite sequence v = vqVi ... of rational numbers, define 

• Sup(-u) = sup{v n | n > 0}; 

• LimSup(i;) = limsup v n = lim sup{vi \ i > n}; 

n-^-oo n->oo 

• Liminf (v) = liminf v n = lim inf{wj | i > n}; 

n— >co n— >oo 

, n— 1 

=o 

oo 
j=0 

Intuitively for a sequence w = t>of i ... of rational numbers from the finite set V, the Sup 
function chooses the maximal number that appear in v; the LimSup function chooses the 
maximal number that appear infinitely often in v; the Liminf function chooses the minimal 
number that appear infinitely often in v; the LimAvg functions gives the long-run average 
of the numbers in v; and the DiscA gives the discounted sum of the numbers in v. Note 
that LimAvg(v) is defined using liminf and is therefore well-defined; all results of this paper 
hold also if the limit-average of v is defined instead as limsup^^^ - • YH=o v i- One could 
also consider the value function inf{t> n | n > 0} and obtain results analogous to the Sup 
value function. Note that the classical finite-word acceptance condition of finite automata 
(defining regular languages) can be encoded by Last-automata with weights in {0, 1}, while 
Biichi and coBiichi automata are special cases of respectively LimSup- and Limlnf-automata, 
with weights in {0, 1}. The class of languages defined by nondeterministic Biichi automata 
is called w-regular. 

Significance of value functions. The value functions provide natural generalizations of the 
classical boolean languages, they are complete for different levels of the Borel hierarchy, and 
they have been well studied in the context of game theory. 

(1) The Sup value function is the natural quantitative generalization of the reachability 
condition and is complete for the first level of the Borel hierarchy (Si complete). 
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(2) The LimSup and Limlnf objectives are the natural quantitative generalizations of the 
classical Biichi and coBiichi conditions. Moreover, the LimSup and Limlnf objectives 
are complete for the second level of the Borel hierarchy, and hence important and 
canonical quantitative functions (LimSup and Limlnf objectives are II2 and E2 complete, 
respectively) (see }Wad84[ [MP92J for details related to completeness and reducibility of 
objectives in the Borel hierarchy). 

(3) The LimAvg and Disc^ value functions have been studied in many different contexts in 
game theory. Discounted functions on graph games were introduced in the seminal work 
of Shapley |Sha53| . and have been extensively studied in economics. Discounted condi- 
tions have also been studied for discounting the future in systems theory [dAHM03j . The 
LimAvg function has also been studied extensively in the context of games on graphs: the 
works of Everett [Eve57j, Liggett-Lippman [LL69J, Hopfman-Karp [HK66], Ehrenfeucht- 
Mycielski |EM79| . Mertens-Neyman [MN81J . Zwick-Paterson [ZP96| have studied dif- 
ferent classes of games with LimAvg objective. Also see the books [FV97, P ut94] for 
applications of discounted and limit-average value functions in the context of games on 
graphs. Moreover, the LimAvg value function is complete for the third level of the Borel 
hierarchy (ri3-complete) [Cha07a]. 

Hence the value functions considered are classical, canonical, and well-studied in the bran- 
ching-time framework of games on graphs, and we study them in the linear-time framework 
of weighted automata. 

Notation. Classes of weighted automata over infinite words are denoted with acronyms 
of the form xy where x is either N(ondeterministic), D(eterministic), or N /b (when deter- 
ministic and nondeterministic automata have the same expressiveness) , and y is one of the 
following: Sup, LsuP(LimSup), LlNF(Limlnf), LAVG(LimAvg), or Disc. For Biichi and 
coBiichi condition, we use BW and CW respectively. 

Reducibility. A class C of weighted automata is reducible to a class C of weighted automata 
if for every A G C there exists A' G C such that La = La>, i.e., La(w) = La>(w) for all (finite 
or infinite) words w. In particular, a class of weighted automata can be determinized if it is 
reducible to its deterministic counterpart. Reducibility relationships for (non) deterministic 
weighted automata are given in [CDH08J. 

Composition. Given two quantitative languages L and V over E, and a rational number 
c, we denote by max(L, L') (resp. min(L, L'), L + L' , c + L, and cL) the quantitative 
language that assigns max{L(u;), L'(w)} (resp. mm{L(w),L'(w)}, L(w) + L'(w), c + L(w), 
and c • L{w)) to each word w € E + (or w S E w ). We say that c + L is the shift by c of L and 
that cL is the scale by c of L. The language 1 — L is called the complement of L. The max, 
min and complement operators for quantitative languages generalize respectively the union, 
intersection and complement operator for boolean languages. For instance, De Morgan's 
laws hold (the complement of the max of two languages is the min of their complement, 
etc.) and complementing twice leave languages unchanged. 
Example 1. We consider a simple illustration of the use of limit- average automata to model 



the energy consumption of a motor. The automaton B in Figure 1(b) specifies the maximal 
energy consumption to maintain the motor on or off, and the maximal consumption for a 
mode change. The specification abstracts away that a mode change can occur smoothly 



with the slow command. A refined specification A is given in Figure 1(a) where the effect 
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(slow) 



slow, 1 

(a) Limit-average automaton A. 



off, 



on, 2 




off, slow, 10 



(b) Limit-average automaton B. 



Figure 1: Specifications for the energy consumption of a motor: A refines B, i.e., La < Lb- 

of slowing down is captured by a third state. One can check that La(w) < Lb(w) for all 
words w £ {on, off , slow}^ . Given two limit-average automata that model the energy con- 
sumption of two different motors, one needs to define composition operations for weighted 
automata to obtain the maximal, minimal, and sum of the average energy consumption of 
the motors. 

Example 2. Consider an investment of 100 dollars that can be made in two banks A\ 
and Ai as follows: (a) 100 dollars to bank A\, (b) 100 dollars to bank A2, or (c) 50 dollars 
to bank Ai and 50 dollars to bank A%- The banks can be either in a good state (denoted 
Gil, G<i) or in a bad state (denoted B\, B2). If it is in a good state, then A\ offers 8% 
reward while A2 offers 6% reward. If it is in a bad state, then A\ offers 2% reward while A^ 
offers 4% reward. The change of state is triggered by the input symbols b%, 62 (from a good 
to a bad state) and g\ , 52 (from a bad to a good state) . The rewards received earlier weight 
more than rewards received later due to inflation represented by the discount factor. The 
automata A\ and A2 in Figure [2] specify the behavior of the two banks for an investment 
of 100 dollars, where the input alphabet is {51, 61} x {52,^2} (where the notation (51, •) 
represents the two letters (51,52) and (51,62)) and similarly for the other symbols). If 
50 dollars are invested in each bank, then we obtain automata C\ and C2 from A\ and A2 
where each reward is halved. The combined automaton is obtained as the composition of C\ 
and C2 under the sum operator. 



3. Expressiveness Results 

The expressive power of weighted automata can be compared by mean of the reducibility 
relation, saying that a class C of weighted automata is at least as expressive as a class C 
if every quantitative language definable by some automaton in C' is also definable by some 
automaton in C. The comparison includes boolean languages, considering them as a special 
case of quantitative languages of the form L : £^ — > {0, 1}. It was shown in [CDH08] that 
a wide variety of classes of quantitative languages can be defined by the different types of 
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(■,92), 6 




(gi,-), 8 (-,52), 6 

(a) 100 dollars invested in bank A±. (b) 100 dollars invested in bank A^. 

Figure 2: The discounted-sum automaton models of two banks. 

weighted automata, depending on the value function and whether they are deterministic 
or not. This contrasts with the situation for boolean languages where most of the classes 
of automata define cj-regular languages. In this section, we investigate alternative ways 
of comparing the expressive power of weighted automata and of classical finite automata. 
First, we use the cut-point languages of weighted automata to compare with the class of 
w-regular languages, and then we use weighted automata with boolean weights, i.e. all 
transitions have weight or 1, to compare with general weighted automata. 



3.1. Cut-point languages. Let L be a quantitative language over infinite words and let 
T) £ R be a threshold. The cut-point language defined by (L, rj) is the (boolean) language 

lF> = {w£E w L{w) > r]}. 

Cut-point languages for finite words are defined analogously. They have been first defined for 
probabilistic automata [Rab63], then generalized to inverse image recognition for semiring 
automata over finite words (see e.g. [KS86, CM00]). It is easy to see that the cut-point 
languages of Max- and Last-automata are regular (they have the same acceptance condition 
as finite automata), those of Sum-automata are context-free (using a stack to simulate 
accumulated weights), and those of Sup-, LimSup-, and Limlnf-automata are w-regular (they 
have the same acceptance condition as Biichi and coBiichi automata). 

We show that the classes of cut-point languages definable by (non) deterministic limit- 
average and discounted-sum automata are incomparable with the w-regular languages. 
One direction of this result follows from Theorem 13.11 and the other direction follows 
from [CDH08, Theorems 13 and 14] where w-regular languages are given that are not defin- 
able as cut-point language of nondeterministic limit-average and discounted-sum automata. 

Theorem 3.1. There exist deterministic limit-average and discounted-sum automata whose 
cut-point language is not uo-regular. 

Proof. Consider the alphabet S = {a, b}, and consider the languages L\ that assigns to each 
word its long-run average number of a's, and L2 that assigns the discounted sum of a's. Note 
that L% is definable by a deterministic limit-average automaton, and L2 by a deterministic 
discounted-sum automaton. It was shown in [Cha07bj that the cut-point language Lf is 
complete for the third level of the Borel hierarchy, and therefore is not w-regular. We show 
that L2 is not w-regular. 

Given a finite word w € £*, let v a (w) 



w. 



We say that w is ambiguous if 1 



1- 



= T-\ 



a X i l be the discounted sum of a's in 
< 1. The ambiguity lies in that some 



continuations of w (namely w.a w ) are in L^ and some are not (namely w.b u ). We show 
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that for all A > ^ , if w is ambiguous, then either w.a or w.b is ambiguous, which entails that 
there exists an infinite word w all of whose finite prefixes are ambiguous (and L2(w) = 1). 
To do this, assume that 1 — j^ < v a (w) < 1, and show that either 1 — \7 < v a (w.a) < 1 

or 1 y^jt < v a(w.b) < 1. Since v a (w.a) = v a (w) + \' w < and v a (w.b) = v a (w), we 

have to show that 1 - ^ < v a (w) < 1 - A^l or f - ^^- < v a (w) < 1. This holds if 

1 ^x~ < 1 — AJ > which is equivalent to A > ^. 

Now, we show that if there exists a nondeterministic Biichi automaton A for L^ , then 
the set of states S n reached in A by reading the first n letters of w (which we denote by 
w)[i... n i) should be different for each n, i.e., n / m implies S n ^ S m . Towards a contradiction, 
assume that S n = S m for n < m. Then for all continuations w' £ S w , we have Wu___ n yw' € 

L^ if and only if Wu„w\ - w ' e -^2" (*)■ 

In particular, consider the continuations u>[ n +i.„] and u)r m +i...i, and for each i > 1, 
let 7* = fa(w[i...j]) and K l = L2(^[«+i...])- Then, we have 7* + A* • K % = 1, and thus 
7 m + A m • K n < 1 iff K n < K m . Since either K n < K m or ^T m < K n , we have ei- 
ther L2(wu m ].w\ n +ii) < 1 or -^2(^[i...n]-^[m+i...]) < 1- By (*)> this implies that either 
-^2(f^[i...m]-^[n+i...]) = Ij or ^2(^[i...n]-' l ^[m+l...]) = 1) an d in both cases since Li2(w) = 1, we 
get 

1 _ y 1 l - 7™ 

A m ~ A 71 
This implies A m " n (l - P(A)) = 1 - Q(A) where P(A) = u a («>[i„.n]) and Q(A) = Uo(t&[i...m]) 
are polynomials of respective degree n — 1 and m — 1, and with coefficients in the set {0, 1}. 
First, observe that the equation is not identically because the coefficient of the term of 
degree is not (as the first letter of w must be b since a is not ambiguous). Second, every 
coefficient in the equation is in the set {—2, —1, 0, 1, 2}, and a classical result shows that if 
- is a solution of a polynomial equation with p and q mutually prime, then p divides the 
coefficient of degree 0, and q divides the coefficient of highest degree. Therefore, no rational 
number in the interval ]^, 1[ can be a solution. This shows that n ^ m implies S n ^ S m , 
and thus the automaton A cannot have finitely many states. □ 

We note that cut-point languages are not stable under arbitrarily small perturbations of 
the transition weights, nor of the value of the cut-point. Consider the quantitative languages 
L\, L2 from the proof of Theorem 13.11 If for instance a limit-average automaton A assigns 
weight 1 + e to the a's and to the 6's, its cut-point language L^ is clearly different from 
Lf , no matter the value of e > 0. The same holds with respect to L2 if A is interpreted as 
a discounted-sum automaton. 

In the theory of probabilistic automata, where finite words are assigned a probability 
of acceptance, the cut-point languages may also be non-regular. Therefore, one considers 
the special case where the cut-point is isolated, and shows that the cut-point languages are 
then regular [Rab63]. 

A number 77 is an isolated cut-point of a quantitative language L if there exists e > 
such that 

\L(w) - 77 1 > e for all w € S w . 

We argue that isolated cut-point languages are robust, in that they remain unchanged 
under small perturbations of the transition weights. This follows from a more general result 
about the robustness of weighted automata. 
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A class of weighted automata is robust if a small (syntactical) perturbation in the 
weights of an automaton induces only a small (semantical) perturbation in the values of 
the words in the quantitative language of the automaton, and the semantical perturbation 
tends to when the syntactical perturbation tends to 0. To formally define robustness, we 
need e-approximations of automata, and distance between quantitative languages. 

Let A = (Q,qi,'E,S,'y} be a (nondeterministic) weighted automaton, and let e £ R-°. 
We say that a weighted automaton B = (Q' , q'j, E, 5' , 7') is an e- approximation of A if 

• Q' = Q, q'j = qi, 5' = 5, and 

• W{q,o-,q') -n?{q,(T,q')\ < e for all (q,a,q') G 5. 

The sup-distance between two quantitative languages L\,L 2 : E w — >■ IR is defined by 

D snp (Li,L 2 ) = sup \Li(w) - L 2 (w)\. 

We say that a class C of weighted automata is uniformly robust if for all r\ G R >0 , there 
exists e G M >0 such that for all automata A,B G C such that B is an e-approximation of 
A, we have D sup (La,Lb) < n. Note that uniform robustness implies a weaker notion of 
robustness where a class C of weighted automata is called robust if for all automata A G C 
and for all r\ G M >0 , there exists e G M >0 such that for all e-approximations B of A (with 
B G C), we have D sup (La,Lb) < 77 (here the value of e can depend for instance on the 
weights of the automaton ^4) . 

Theorem 3.2. The classes of (non) deterministic Sup-, LimSup-, Limlnf-, LimAvg- and 

Disc-automata are uniformly robust. 

Proof. Let A, B be two weighted automata with B an e-approximation of A. It is easy to 
see that for Sup-, LimSup-, Limlnf- and LimAvg-automata, the value of a run r of B differs 
by at most e from the value of the same run in A. Therefore, D sup (La, Lb) < e and we can 
take e = n. For Disc-automata, the value of a run of B differs by at most j^ from the value 
of the same run in A, where A is the discount factor. Therefore, we can take e = 7/(1 — A). □ 

As a corollary of Theorem 13.21 for an isolated cut-point r), the cut-point language L- v 
remains unchanged under small perturbations of the weights. 

Corollary 3.3. Let L^ be the quantitative language defined by a weighted automaton A, 
and let rj be an isolated cut-point of La- There exists a rational e > such that for all e- 
approximations B of A, we have L~^ = Ljp (where Lb is the quantitative language defined 
byB). 

Now, we show that the isolated cut-point languages of deterministic discounted-sum and 
limit- average automata are w-regular. For nondeterministic automata, the same property 
holds in the discounted-sum case, but the question is open for limit- average. 

Theorem 3.4. Let L be the quantitative language defined by a Disc- automaton. If rj is an 
isolated cut-point of L, then the cut-point language L- v is uo-regular. 

Proof. Let A be the discount factor of the Disc-automaton A that defines L. Since n is an 
isolated cut-point of L, let e > such that \L(w) — 77 1 > e for all w G E w . Let n G N such 
that u n = \-^\ < e where V = m&x.r qjCrt q^ e g A \j(q, a, q')\ is the largest weight in A. Note 
that u n is a bound on the difference between the A-discounted sum of the weights in any 
infinite run f of A and the A-discounted sum of the weights in the prefix of length n of f, 
and u n — > when n — > 00. 
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Consider an arbitrary run r in A of length n, and let j(r) be the A-discounted sum of the 
weights along r. Then, it should be clear that 7(7-) [77 — e + u n ,r] + e — u n ], because 
otherwise, the value of any (infinite) continuation of r would lie in the interval [rj — e, rj + e], 
which would be a contradiction to the fact that 77 is an isolated cut-point of L. Moreover, if 
7(7") < rj — e+Un, then any (infinite) continuation of r has value less than rj — e + 2u n < rj + e, 
and therefore less than 77, while if 7(7") > rj + e — u n , then any (infinite) continuation of 
r has value greater than 77. Therefore, the cut-point language L- v can be defined by the 
unfolding up to length n of the Disc-automaton that defines L, in which the states that are 
reached via a path with value at least 77 + e — u n are declared to be accepting (for Biichi 
condition), and have a self- loop on E. □ 

Theorem 3.5. Let L be the quantitative language defined by a deterministic LimAvg- 
automaton. If rj is an isolated cut-point of L, then the cut-point language L- v is u-regular. 

Proof. Let Abe & deterministic LimAvg-automaton, defining the language L. Consider the 
SCC-decomposition C±, C2, ■ ■ ■ , C^ of the underlying graph of A. For each 1 < i < k, let 
nii and Mi be the minimal and maximal average weight of a cycle in Cj (those values can 
be computed with Karp's algorithm [Kar78| ) . It is easy to see that for every 1 < % < k, for 
every v £ [mi, Mi], there exists a word w £ E w such that L{w) = v. Therefore, since 7/ is 
an isolated cut-point of L, we have rj [mi, Mi] for all 1 < % < k. A deterministic Biichi 
automaton (DBW) for L-* 1 is obtained from A by declaring to be accepting all states q of 
A such that q £ Cj and ttt-j > rj. □ 

3.2. Boolean 'weights. We consider weighted automata with boolean set of weights, i.e. 
all transitions have weight or 1. The aim is to have a boolean counterpart to limit- 
average and discounted-sum automata, and compare the expressive power. We show that 
the restriction does not change the class of quantitative languages definable by limit-average 
automata, but does reduce the expressive power of discounted-sum automata. 

Theorem 3.6. The class of nondeterministic (resp., deterministic) LimAvg- automata with 
rational weights in [0,1] is reducible to the class of nondeterministic (resp., deterministic) 
LimAvg- automata with weights and 1 only. 

Proof. Given a NLavg A = (Q, qj, S, 5, 7) with weights in [0, 1], we construct a NLavg B 
with weights in {0, 1} such that La = Lb- 

First, let W = {"fiq, o, q') | (q, a, q') € 5} be the set of weights that occur in A, and let 
77,4 be the smallest integer n such that for all v £ W, there exists p £ N such that v = - (i.e., 
— is the greatest common divisor of the weights of A). We define B = (Q', q\, E, 5' , 7') as 
follows: 

• Q' = Q x [tia] (where [ua] denotes the set {0,1, ... ,ua~ 1}). Intuitively, when we reach 
a state (q, i) in B, it means that the state q was reachable in A and that the sum of the 
weights to reach q is of the form k + — for some integer k. In B however, the sum of the 
weights to reach (q, i) will then be k, and we store in the discrete state the information 
that the remainder weight is — . Whenever this remainder exceeds 1, we introduce a 
weight 1 and decrement the remainder. 

• q'j = (qi,0); 

• for each transition (q,o~,q') £ 5 and each value i £ [ha], the following transitions are in 
5' (where v = 7(5, a, q')): 
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— ((g, i),cr, (q' ,j)) for j = i + v ■ ua if ^- + v < 1; the weight of such a transition is in 

— ((q, i),cr, (q' ,j)) for j = i + (v — 1) • ua if — — h u > 1; the weight of such a transition 
is 1 in 7'. 

Note that in the above, v ■ ua is an integer and j G [n^]. 

There is a straightforward correspondence between the runs in A and the runs in B. 
Moreover, if the average weight of a prefix of length n of a run in A is — , then the average 
weight of the prefix of length n of the corresponding run in B is between — and =^—. Hence 
the difference tends to when n —> 00. Therefore, the value of a run in A is the same as 
the value of the corresponding run in B, and therefore La = Lb- 

Finally, note that if A is deterministic, then B is deterministic. □ 

Theorem 3.7. The class of deterministic Disc-automata with rational weights in [0, 1] is 
not reducible to the class of (even nondeterministic) Disc-automata with weights and 1 
only. 

Proof. Given a discount factor < A < 1, consider the DDisc over £ = {a, b) that consists 
of a single state with a self- loop over a with weight ^- and a self- loop over b with weight 0. 
Let L\ be the quantitative language defined by this automaton. Towards a contradiction, 
assume that this language is defined by a NDisc A with weights in {0, 1}. First, consider 
the word ab^ whose value in L\ is — ^— < 1. This entails that A cannot have a transition 
from the initial state over a with weight 1 (as this would imply that La{o,V*') > 1). Now, 
the maximal value that La can assign to the word a w is A + A 2 + A 3 + • • • = Y-\ which is 
strictly smaller than L\(a w ) = 2 ti-\) • This shows that A cannot exist. □ 

4. Closure Properties 

We study the closure properties of weighted automata with respect to max, min, com- 
plement and sum. We say that a class C of weighted automata is closed under a binary 
operator op(-, •) (resp. a unary operator op'(-)) if for all Ai,A 2 € C, there exists A± 2 € C 
such that La 12 = op(La 1 ,La 2 ) (resp. La 12 = op'(Ivii))- All closure properties that we 
present in this paper are constructive: when C is closed under an operator, we can always 
construct the automaton A12 € C given Ai,A 2 € C. We say that the cost of the closure 
property of C under a binary operator op is at most 0(f(ni,mi,n 2 ,m 2 )) if for all automata 
A\,A 2 G C with ni states and nii transitions (for i = 1,2 respectively), the constructed 
automaton A± 2 € C such that La 12 = op(La 1 ,La 2 ) has at most 0(f(ni,mi,n 2 ,m 2 )) many 
states. Analogously, the cost of the closure property of C under a unary operator op' is at 
most 0(f(n, to)) if for all automata A\dC with n states and to transitions, the constructed 
automaton A\ 2 G C such that La 12 = op'(-L^ 1 ) has at most 0(f(n,m)) many states. For 
all reductions presented, the size of the largest weight in Ai 2 is linear in the size p of the 
largest weight in A±,A 2 (however, the time needed to compute the weights is quadratic in 
p, as we need addition, multiplication, or comparison, which are quadratic in p). 

Notice that every class of weighted automata is closed under shift by c and under scale 
by \c\ for all c £ Q. For Sum-automata and discounted-sum automata, we can define the 
shift by c by making a copy of the initial states and adding c to the weights of all its 
outgoing transitions. For the other automata, it suffices to add c to (resp. multiply by |c|) 
all weights of an automaton to obtain the automaton for the shift by c (resp. scale by |c|) of 
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its language. Therefore, all closure properties also hold if the complement of a quantitative 
language L was defined as k — L for any constant k. 

Our purpose is the study of quantitative languages over infinite words. For the sake of 
completeness we first give an overview of the closure properties for finite words. Table [Ha) 
summarizes the closure properties for finite words, and Table QJb) for infinite words. 

4.1. Closure properties for finite words. For finite words, we consider closure under 
max, min, complement, and sum for Max-, Last- and Sum-automata. 

Theorem 4.1. Deterministic Max- and Last-automata are closed under max, with cost 
0{n\ ■ n2). Nondeterministic Max-, Last- and Sum-automata are closed under max, with 
cost 0(m + ri2). Deterministic Sum-automata are not closed under max. 

Proof. For the nondeterministic automata, the result follows from the fact that the max 
operator can be obtained by an initial nondeterministic choice between two quantitative 
automata. For deterministic Max- and Last-automata, the result is obtained using a stan- 
dard synchronized product construction, where the weight of a transition in the product is 
the maximum of the corresponding transition weights in the two automata. Finally, deter- 
ministic Sum-automata are not closed under the max operator because the language over 
X = {a, b} that assigns to each finite word w € S + the number ma,x{L a (w),Lb(w)} where 
L a [w) is the number of occurrences of a in w (for a = a,b) is definable by the max of 
two deterministic-Sum languages, but not by a deterministic Sum-automaton (Theorem 2 
in [CDHOSp . □ 

Theorem 4.2. Deterministic and nondeterministic Max-automata are closed under min, 
with cost 0(n\ ■ m\ ■ n<i ■ 1712). Deterministic and nondeterministic Last-automata are closed 
under min, with cost 0(n\ -r^). Deterministic and nondeterministic Sum-automata are not 
closed under min. 

Proof. Given two Last-automata A± and A2 (over the same alphabet), we use the classi- 
cal synchronized product A12 = A\ x A2, where the weight of a transition in A12 is the 
minimum of the corresponding transition weights in Ai and A2- It is easy to see that 
La 12 = niin(.L ( 4 1 ,.L ( 4 3 ). If A\ and A2 are deterministic, then so is A\i. 

The construction for Max- automata is the same as for Sup-automata over infinite words 
given in the proof of Theorem 14.71 

Finally, for Sum-automata, consider the language L m over E = {a, 6} that assigns 
to each finite word w € S + the value min{L a (w), L^w)} where L a {w) is the number of 
occurrences of a in w (for a = a, b). We claim that L m is not definable by a nondeterministic 
Sum-automaton. Indeed, assume that the Sum-automaton A with state space Q defines L m . 
First, the sum of weights in every reachable cycle of A over a's must be at most 0. Otherwise, 
we can reach the cycle with a finite word w\ and obtain an arbitrarily large value for the 
word w\a l for i sufficiently large, while for any such i the value of w\a l is the number of 
6's in w\ which is independent of i. Analogously, the sum of weights in every reachable 
cycle of A over fe's must be at most 0. Now, let j3 = max eG( 5|7(e)| be the maximal weight 
in A, and consider the word w = a n b n for n > 2/3 ■ \Q\. Every run of A over a n (or over b n ) 
can be decomposed in possibly nested cycles (since A is nondeterministic) and a remaining 
non-cyclic path of length at most \Q\. Hence, the value of any run over w is at most 2j3-\Q\. 
However, the value of w should be n, thus A cannot exist. O 
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Theorem 4.3. Deterministic Last- and Sum-automata are closed under complement, with 
cost 0(n). Nondeterministic Last-automata are closed under complement, with cost 0(2 n ). 
Nondeterministic Sum automata, and both deterministic and nondeterministic Max-auto- 
mata are not closed under complement. 

Proof. To define the complement of the language of a deterministic Sum- (or Last-) automa- 
ton, it suffices to multiply all the weights by —1, and then shift the language by 1. For the 
class of nondeterministic Last-automata, the result follows from the fact that it is reducible 
to its deterministic counterpart. 

The negative result for Max- automat a follows from an analogous in the boolean case 
(consider the language L over {a, b} such that L(a t ) = for all i > 1, and L(w) = 1 for all 
words containing the letter b) . Finally, according to the proof of Theorem 14.21 the language 
min(L a , L^) where L a {w) is the number of occurrences of a in w (for a = a,b) is not definable 
by a nondeterministic Sum-automaton. Since min(L a ,L(,) = 1 — max(l — L a , 1 — Lj,) and (i) 
1 — L a and 1 — Lf, are definable by Sum-automata, and (ii) nondeterministic Sum-automata 
are closed under max (Theorem 14. ip . the language max(l — L a , 1 — Lf,) is definable by a 
nondeterministic Sum-automaton, but not its complement and the result follows. |_J 

Theorem 4.4. Every class of weighted automata over finite words is closed under sum. The 
cost is 0{n\ ■ n 2 ) for Last- and Sum-automata, and 0(n\ ■ mi ■ n2 ■ m<2) for Max-automata. 

Proof. It is easy to see that the synchronized product of two Last-automata (resp. Sum- 
automata) defines the sum of their languages if the weight of a joint transition is defined 
as the sum of the weights of the corresponding transitions in the two Last-automata (resp. 
Sum-automata). 

We give the construction for two Max- automat a A\ = (Q\,q\, S,#i,7i) and A2 = 
{Q2,q], S, ^2,72)- We construct a Max-automaton A12 = (Q, (//, £,5, 7) such that L*a 12 = 
La x + La 2 - Let Vi = {71(e) | e G 5i} be the set of weights that appear in Ai (for i = 1, 2), 
and define: 

• Q = Q\ X V\ X Qi X V%. Intuitively, we remember in a state (qi,vi,q2,V2) the largest 
weights v\ , V2 seen so far in the corresponding runs of A\ and A2 ; 

• Qi = (<?/, w min' ih ^min) where v l min is the minimal weight in Vi (for * = 1,2); 

• For each a £ S, the set 5 contains all the triples {(qi, Vi, qi-, V2),o~, (q[, v[, g 2 ,u 2 )) such 
that Vi G Vi, (qi, a, q[) G Si, and v[ = maxjuj, yfa, a, q'A}, for i = 1, 2; 

• 7 is defined by 

l(((qi,vi,q2,V2),o-,(q' 1 ,v' 1 ,q 2 ,v! 2 ))) = v' 1 +v' 2 
for each ((qi,v 1 ,q 2 ,v 2 ),o-,(q' 1 ,v' 1 ,q' 2 ,v 2 )) eS. 
If A\ and A2 are deterministic, then A12 is deterministic. The result for deterministic Max- 
automata follows. □ 

4.2. Closure under max for infinite words. The maximum of two quantitative lan- 
guages defined by nondeterministic automata can be obtained by an initial nondeterministic 
choice between the two automata. This observation was also made in [DR07J for discounted- 
sum automata. For deterministic automata, a synchronized product can be used for Sup and 
LimSup, while for Limlnf we use the fact that NLinf is determinizable with an exponential 
blow-up [CDH08] . 
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Theorem 4.5. The nondeterministic Sup- ; LimSup-, Limlnf-, LimAvg- and Disc-automata 
are closed under max, with cost 0{n\ + n<i), the deterministic Sup- and LimSup- automata 
with cost 0{n\ ■ n'l), the deterministic Limlnf -automata with cost 0{{m\ + mj)™ 1 " 1 "™ 2 ). 

Sketch. For all the nondeterministic quantitative automata, the result follows from the fact 
that the max operator can be achieved with an initial nondeterministic choice between two 
weighted automata. For DLinf, the result follows from the reducibility of NLinf to DLlNF 
with an exponential blow-up JCDH08J . We now prove that DLsup and DSup are closed 
under max with cost 0(n\ ■ 712). Given two DLsup (or DSup) A\ and A2 over the same 
alphabet, we construct the usual synchronized product A12 = A\ x _A 2) where the weight of 
a transition in A\2 is the maximum of the corresponding transition weights in A\ and A2 ■ 
It is easy to see that La 12 = max(L J 4 1 , La 2 ) in both cases. □ 

Theorem 4.6. The deterministic LimAvg- and Disc-automata are not closed under max. 

Proof. The fact that DDisc is not closed under max follows from the proof of Theorem 16 
in [CDH08J . where it is shown that the quantitative language max(Li, L%) cannot be defined 
by a DDisc, where L\ (resp. L2) is the language defined by the DDisc that assigns weight 
1 (resp. 0) to a's and weight (resp. 1) to 6's. 

We now show that DLavg is not closed under max. Consider the alphabet £ = 
{a, 0} and the quantitative languages L a and L\, that assign the value of long-run average 
number of a's and 6's, respectively. There exists DLavg for L a and Lf,. We show that 
L m = max(L a ,Lfe) cannot be expressed by a DLavg. By contradiction, assume that A is 
a DLavg with set of states Q that defines L m . Consider any reachable cycle C over a's in 
A. The sum of the weights of the cycle must be its length \C\, as if we consider the word 
w* = wc ■ (a' ') w where wc is a finite word whose run reaches C, the value of w* in L m is 
1. It follows that the sum of the weights of the cycle C must be \C\. Hence, the sum of the 
weights of all the reachable cycles C over a's in A is \C\. 

Consider the infinite word w^ = (a^ • & 2 I < 2I) W , and let Wj = (a™ ■ b 2 ^y . Since 
L m (w 00 ) = |, the run of A over w^ has value |. It follows that for all e > 0, there is an 
integer j £ , such that for all j > j £ , we have 

\wj\ 3 

where j(wj) is the sum of the weights of the run of A over Wj. Consider a word w^ 
constructed as follows. We start with the empty word wq and the initial state oq of A, and 
for all j > 0, we construct (wj+i,qj+i) from (v}j,qj) as follows: the state g J+ i is the last 
state of the run of A from qj over a'™ • b 2 '®'. This run has to contain a cycle Cj+i over a's. 
We set Wj+i = Wj • a'^' + ' Cj+1 ' ■ b 2 '®'. Observe that for all j > 1, the run of A over w^ in the 
segment between Wj and Wj+i is identical to the run from qj to a J+ i up to the repetition of 
the cycle Cj+i once more. The word Woq is the limit of this construction (wj is a prefix of 
Woo for all j > 0). Let ctj = YH=i\Ci\- Since 1 < |Cj| < \Q\ we have j < a,j < j ■ \Q\. Hence 
we have the following equality: , ^ = 7 , ,, aj . Hence for all e > 0, there exists j e such 

\Wj\ \Wj\+aj 
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that for all j > j £ we have 



7(%) . §■ \wj\ - e • \wj\ + atj 



> 



\w j \+a j 



2 1 ou 

> - -e-\ 1 — r 1 

3 3 \wj\ + a,- 

2 1 7 



3 3 j-(3\Q\ + \Q\) 

" 3~ £+ 12\Ql 

This shows that lim infj_ s . 00 |JV > § + 12 }n\ an d thus we have La{w oo) > § + 12 }oi • 
Since 1 < \C{\ < |Q| for all i > 1, we have L m (u; 00 ) < g which is a contradiction. □ 

4.3. Closure under min for infinite words. The positive results about closure properties 
under min for quantitative languages generalize the closure properties of boolean languages 
under intersection. The constructions are straightforward extensions of the standard con- 
structions for finite, Biichi, and coBiichi automata (see e.g. |Var96] ) . 

Theorem 4.7. The (non) deterministic Sup-automata are closed under min, with cost 0{n\- 
mi ■ n 2 ■ 1712), 

Proof. The construction in the proof of Theorem 14.41 can be adapted by defining the weight 
l({(qi, vi, q 2 , v 2 ), o-,(q[, v[, q 2 ,v' 2 ))) asmin{v[,v 2 } ior each ((qi,vi,q 2 ,v 2 ), a, (q[,v[,q 2 ,v' 2 )) G 
6. □ 

Theorem 4.8. The deterministic LimSup- automata are closed under min with cost 0{n\ ■ 

n 2 -2 mi+m2 ). 

Proof. Let A± = {Qi,q},Yi, #1,71) and A 2 = (Q 2 ,qj,'^,5 2 ,^ 2 } be two DLsup. We construct 
a DLsup A = (Q,qi, S, 6, 7) such that La = minjL^j, La 2 }- Let Vi = {7i(e) | e G 5i} be 
the set of weights that occur in A4 (for i = 1,2). For each weight v & V\ U V 2 = {v\ ,... ,v n }, 
we construct a DBW A\ 2 that consists of a copy of A\ and a copy of A 2 . We switch from 
one copy to the other whenever an edge with weight at least v is crossed. All such switching 
edges are accepting in A\ 2 (i.e., they have weight 1 while all other edges have weight 0). The 
automaton A then consists of the synchronized product of these DBW, where the weight 
of a joint edge is the largest weight v for which the underlying edge in A\ 2 is accepting. 
Formally, let 

• Q = Qi x Q 2 x {l,2} m where m = \Vi U V 2 \ (and assume V\ U V 2 = {v\, . . . , v m }); 

• Qi = (q}> ?j > &ii • • • j b m ) where bi = 1 for all 1 < z < m; 

• 5 contains all the triples ((gi, q 2 , b±, . . . , b m ),a, (g^, q' 2 , 6' l5 . . . , b' m )) such that <7 € £ and 

- (fftjO - ,^) G ^ for i = 1,2; 

— for all 1 < j < m, we have b'- = 3 — bj if 7^. (g&,. , a, q' b . ) > Uj , and 6' = 6j otherwise. 

• 7 assigns to each transition ({qi,q 2 , 61, ... , b m ),a, {q[, q' 2 , b^, . . . , 6^)) G <5 the weight u = 
max({f m i n } U {fj [ bj ^ 6'}) where v m \ n is the minimal weight in V% U V2. D 
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Theorem 4.9. The (non) deterministic Li m I nf -automata are closed under min with cost 
0{n\ ■ n<i), and the nondeterministic LimSup- automata with cost 0(n\ ■ ri2 ■ (mi + 1712))- 

Proof. Let A\ = ((5i,g), £,<5i,7i) and Ai = {Q2,QJ, S, 82, 72) be two NLsup. We construct 
a NLsup A = (Q,qi, £,<5, 7) such that La = minjL^^L^}. Let Vi = J7i(e) j e G Jj} be 
the set of weights that appear in Ai (for i = 1,2). Let V\ U V2 = {f 1, • • • , v n } and define 

• Q = {qi} U Qi x Q 2 x {1, 2} x (Vi U V2) (where qi Qi U Q2 is a new state). Initially, 
a guess is made of the value v of the input word. Then, we check that both A± and A2 
visit a weight at least v infinitely often. In a state (qi,q2,j, v) of ^4, the guess is stored in 
v (and will never change along a run) and the value of the index j is toggled to 3 — j as 
soon as Aj does visit a weight at least v; 

• For each a £S, the set <5 contains all the triples 

— (qi, a, (qi,q2, 1, ^)) such that v G Vi U V2 and for all i G {1, 2}, we have (g}, a, qi) € <5j. 

- ((qi,Q2,j,v),a, (q[,q2,j',v'}) such that t/ = v, (qi,a,q[) e Si (i = 1,2), and / = 3- j 
if jj(qj,a, q'A > v, and j' = j otherwise. 

• 7 is defined by j{qi,a,(q 1 ,q 2 ,l,v)) = and j((qi,q2,j,v),a,(q' 1 ,q 2 ,j',v'}) is v if j ^ j' 
and v m [ n otherwise, where v m \ n is the minimal weight in V\ U V^- 

For the case of Limlnf-automata A±,A2, we can use the synchronized product ^4i2 = A1XA2, 
where the weight of a joint transition in A12 is the minimum of the corresponding transition 
weights in Ai and Ai- It is easy to see that La 12 = rn i n (-^Ai ) ^A 2 ) i n both cases, and A12 
is deterministic when A\ and A2 are deterministic. This case is simpler also because for 
Limlnf-automata, deterministic and nondeterministic automata have the same expressive 
power. □ 

On the negative side, the (deterministic or not) limit-average and discounted-sum au- 
tomata are not closed under min. 

Theorem 4.10. The (non) deterministic LimAvg- automata are not closed under min. 

Proof. Consider the alphabet S = {a, b}, and consider the languages L a and L^ that assign 

the long-run average number of a's and b's, respectively. Note that there exist DLavg for 

the languages L a and L^. 

We show that there is no NLavg for the language L m = mm{L a ,Lt,}. To obtain 

a contradiction, assume that there exists a NLavg A for L m . We first claim that there 

must be either an a-cycle or a 6-cycle C that is reachable in A such that the sum of the 

weights in C is positive. Otherwise, if for all a-cycles and 6-cycles we have that the sum 

of the weights is zero or negative, then we fool the automaton as follows. Let (5 be the 

maximum of the absolute values of the weights in A, and let a = [/3] . Then consider the 

word w = (a 5 ' a ''^' • & 5 ' a 'IQI) w . F or a run r oi A over w, the long-run average of the weights 

is bounded as follows: 

4-/3-|Q| < 2 



10 -a- \Q\ ~ 5 

The above bound is as follows: in the run over a 5 '"'^', there can be a prefix of size at most 
\Q\ with sum of weights at most \Q\ -j5, and then there would be a-cycles, and then a trailing 
prefix of size at most \Q\ with sum of weights at most \Q\ ■ (3. Similar argument holds for the 
segment of b 5 ' a '<®<. Hence La{w) < |, however, L m (w) = 5, i.e., we have a contradiction. 
W.l.o.g., we assume that there is an a-cycle C such that the sum of weights of C is positive. 
Then we present the following word w: a finite word wc to reach the cycle C, followed by 



18 K. CHATTERJEE, L. DOYEN, AND T. A. HENZINGER 



a w ; the answer of the automaton is positive, i.e., La{w) > 0, while L m (w) = 0. Hence the 
result follows. □ 

Finally, we show that discounted-sum automata are not closed under min. 

Theorem 4.11. The (non) deterministic Disc-automata are not closed under min. 

Proof. Let A be a non-algebraic number in J^jlf- We consider the quantitative languages 
L^ and L^ that assign the A-discounted sum of a's and o's, respectively. Formally, given a 
(finite or infinite) word w = wqWi ■ ■ ■ £ £* U S w , let 

M M 

v a (w) = ^ A 4 and Vb(w) = ^ A* 

i\uii=a i\vii=b 

be the A-discounted sum of the a's (resp. 6's) of w. Then, L^(w) = v a (w) and L^(w) = 
Vb(w). These languages are definable by DDisc. We show that the language L m = 
min(L^,L^) is not definable by a NDisc. 

Assume towards contradiction that there is a NDisc A for L m . By Lemmas 2 and 3 
in [CDH08J . there exists an infinite word w such that v a (w) = Vb(w). 

Since v a (w) + Vb{w) = j^, we have L m (w) = 2 ti-x) and this is the maximal value of 
a word in L m (-). 

The maximal value in the automaton A can be obtained for a lasso-word of the form 
W\.{w2) u (where 101,102 are finite words and W2 is nonempty), as pure memoryless strate- 
gies exist in games over finite graphs with the objective to maximize the discounted sum 
of payoffs. Since the language of A is L m , the value of wi.(w2) w is 2 (i-a) ' anc ^ * nus 
v a (wi.(w2) u ) = Vb(wi.(w2) UJ ) by a similar argument as above. This last condition can 
be written as 

m , A^-a a (A) X ni -q b (X) 

p»M + 1 _ xn2 = PbW + x _ xn2 

for some polynomials p a ,Pb,Qa,Qb and integers n\ > and n^ > 0, or more simply as 

(l-A n2 )-p(A) + A ni -g(A) = (4.1) 

for some polynomials p of degree ri\ — 1 and q of degree ni — 1, all of whose coefficients are 
either 1 or — 1. Equation (|4.ip is not identically zero as either (1) n\ = and it reduces to 
q(X) = or (ii) n\ > and then p has degree at least so that the term of degree zero is 
not null in (|4.1|) . 

Therefore, A must be algebraic, a contradiction. □ 

4.4. Closure under complement for infinite words. Most of the weighted automata 
are not closed under complement. The next result is a direct extension of the boolean case. 

Theorem 4.12. The (non) deterministic Sup- and Limlnf-automata, and the deterministic 
LimSup-automata are not closed under complement. 

Proof. The result follows from a similar result for the boolean version of these classes. For 
DSup and NSup, consider the language L\ over E = {a, 0} such that Li(a u ) = and 
L\(w) = 1 for all w 7^ a u . For DLinf and NLinf, consider the language L2 over S = {a, 6} 
such that L2(S*.a w ) = 1 and L(w) = for all words w containing infinitely many 6's, and 
for DLSUP, consider L3 the complement of Li- O 
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Figure 3: Deterministic Limit- average Automaton. 

The next theorem is a positive result of closure under complementation for NLsup. It 
reduces to the complementation of nondeterministic Biichi automata. 

Theorem 4.13. The nondeterministic LimSup- automata are closed under complement, with 
costO{m-2 nl °z n ). 

Proof. Let A = (Q, q , £, S, 7) be a NLsup, and let V = {7(e) | e G 5} be the set of weights 
that appear in A. For each v G V, it is easy to construct a NBW A v whose (boolean) 
language is the set of words w such that La(w) > v, by declaring to be accepting the edges 
with weight at least v. We then construct for each v G V a NBW A v (with accepting 
edges) that accepts the (boolean) complement of the language accepted by A v . Finally, 
assuming that V = {vi, . . . ,v n } with v\ < V2 < ■ ■ ■ < v n , we construct the NLsup Bi for 
i = 2, ... ,n where Bi is obtained from A Vi by assigning weight 1 — Vi-\ to each accepting 
edges, and 1 — v n to all other edges. The complement of La is then max{L^ 2 , . . . , Ls n } 
which is accepted by a NLsup by Theorem 14.51 □ 

Theorem 4.14. The deterministic Disc-automata are closed under complement, with cost 
0(n). 

sketch. It suffices to replace each weight v of a DDisc by 1 — A — v (where A is the discount 
factor) to obtain the DDisc for the complement. □ 

Theorem 4.15. The deterministic LimAvg- automata are not closed under complement. 

Proof. Consider the DLavg A over alphabet £ = {a, b} (shown in Figure [3|) that consists 
of a single self-loop state with weight 1 for a and for b. Notice that LA{w.a u ) = 1 and 
LiAiw.U* 1 ) = for all w G E*. To obtain a contradiction, assume that there exists a DLavg 
B whose language is Lb = 1 — La- For all finite words w G £*, let Lg g (w) be the average 
weight of the unique (finite) run of B over w. 

Fix < e < g- For all finite words w, there exists a number n w such that the average 

number of a's in w.b Uw is at most e, and there exists a number m w such that L B vg (w.a mw ) < e 
(since Lsiw.a^) = 0). Hence, we can construct a word w = b ni a mi b n2 a rn2 ... such that 
La(w) < e and Lb{w) < e. Since Lb = 1 — La, this implies that 1 < 2e, a contradiction. □ 

Theorem 4.16. The nondeterministic LimAvg- and Disc-automata are not closed under 
complement. 

Proof. The fact that NLavg are not closed under complementation is as follows. Consider 
the quantitative language L* = 1— max{L a , L&} where L a and Lj, assign the long-run average 
number of a's and 6's, respectively. Exactly the same argument as in the proof of Theo- 
rem HTTU] shows that L* cannot be expressed as a NLavg, while the language max{L a ,L{,} 
can be expressed as NLavg by Theorem 14.51 



20 K. CHATTERJEE, L. DOYEN, AND T. A. HENZINGER 



That NDisc are not closed under complement can be obtained as follows: given < A < 
1, consider the language L^ and L^ that assigns to words the A-discounted sum of a's and 
6's, respectively. The language L\ and L^ can be expressed as DDisc, and the max of them 
can be defined by NDisc. Observe that L*(w) + L^(w) = j^j for all w G S w . Therefore, 
min{L^,L^} = ^—^ — max{L^,L^}. Since NDisc is not closed under min (Theorem 14. lip , 
we immediately obtain that NDisc are not closed under complementation. d 

4.5. Closure under sum for infinite words. All weighted automata are closed under 
sum, except DLavg and NLavg. 

Theorem 4.17. The (non) deterministic Sup-automata are closed under sum, with cost 
0{n\ ■ m\ ■ 712 • m-i). 

Proof. The construction is the same as for Max-automata over finite words given in the 
proof of Theorem 14.41 □ 

Theorem 4.18. The nondeterministic LimSup- automata are closed under sum, with cost 
0(ni ■ mi • ni • m.2). 

Sketch. Given two NLsup A\ and A2, we construct a NLsup A for the sum of their lan- 
guages as follows. Initially, we make a guess of a pair (1^1,^2) of weights {v i in A{, for i = 1,2) 
and we branch to a copy of the synchronized product of A± and A<i- We attach a bit b whose 
range is {1, 2} to each state to remember that we expect A), to visit the guessed weight v^. 
Whenever this occurs, the bit b is set to 3 — b, and the weight of the transition is i>i + U2- All 
other transitions (i.e. when b is unchanged) have weight minj^i + v-i \ V\ G V\ A Vi G V2}. Q 

Theorem 4.19. The deterministic LimSup- automata are closed under sum, with cost 0(n\ ■ 

n r 2 mi ™ a ). 

Proof. Let A\ = (Qi,qj, £,<5i,7i) and A2 = (Q2,qj,^,^2,l2) be two DLsup. We construct 
a DLsup A = (Q, gj, E, <5, 7) such that La = La 1 + La 2 - Let V, = {71(e) | e G 5i} be the set 
of weights that appear in Ai (for i = 1,2). The automaton A implements the synchronized 
product of A\ and A2, and keeps one bit b(v\,V2) for each pair (^1,^2) of weights v\ G V\ 
and V2 G V<j. For i = 1,2, if 6(^1,^2) = i, then A4 is expected to cross a transition with 
weight Vi. Whenever this occurs, the bit is set to 3 — i. The weight of a transition in A 
is the largest value of v\ + V2 such that the corresponding bit 6(^1,^2) has changed in the 
transition. Formally, we define: 
. Q = Q lX Q 2 x \yixV 2 ->{l,2}]i 

• 11 = {qjiqjibi) where 6/(^1,^2) = 1 for all (t>i,t>2) G V\ X V 2 ; 

• For each a G E, the set 5 contains all the triples {{qi,q2-,b),a,(q' l ,q' 2 ,b')) such that 
{qi-,°~-,q'i) £ o~i (i = 1,2), and for all (ui,u 2 ) G Vi X V 2 , we have b'(vi,V2) = 3 - 6(^1,^2) if 
li({qi-,o~-,q'i)) = Vi for i = b(vi,v 2 ), and otherwise b'(v\,V2) = b(vi,v 2 ). 

• 7 is defined by j({qi, q2, b),a, {q[, q' 2 ,b')) = max({i> min U {vi + v 2 | b'(vi,v 2 ) / 6(^1,^2)}) 
where v m - m is the minimal weight in V\ + Vjj = {vi + V2 \ V\ G V\ A v 2 G V2}. D 
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Theorem 4.20. The (non) deterministic Limlnf -automata are closed under sum with cost 
0(m • n 2 • 2 mi ' m2 ). 

Proof. Let A\ = (Qi, g), E,£i,7i) and A2 = (Q2, <zf>E,52, 72) be two NLinf. We construct 
a NLinf A = (Q, gj, E, 5, 7) such that La = La 1 + La 2 - Let V^ = {74(e) \ e £ Si} be the set 
of weights that appear in Ai (for i = 1,2). The automaton A implements the synchronized 
product of Ai and A2, and keeps one bit b(vx,v 2 ) for each pair (^1,^2) of weights ui G Vi 
and t>2 G V2. If a transition in Ai for some i G {1,2} has weight less than Vi, then the bit 
b(vi,V2) is set to _L, otherwise is set to T. The weight of a transition in A is the largest 
value of v\ + v 2 such that the corresponding bit b(v\,v 2 ) is T. Formally, we define: 
. Q = Q 1 xQ 2 x[V 1 xV 2 ->{T,±}]; 

• Ql = (q},qj,bi) where bj(vi,v 2 ) = -L for all (vi,v 2 ) G V\ X V2; 

• For each a G E, the set (5 contains all the triples ((qi,q2,b),o-, (q'i,q' 2 ,b')) such that 
(gi, a, <j£) G <5j (i = 1,2), and for all {v\,v 2 ) G V\ X V2, we have y(vi,v 2 ) = T if 
jidqijCr, q!j)) > Vi for i = 1,2, and otherwise &'(t>i,t>2) = JL. 

• 7 is defined by ~/((qi,q 2 ,b),cr, (q^q'^b 1 )) = max({i> min U {vi + v 2 \ b f (vi,v 2 ) = T}) where 
v m in is the minimal weight in V\ + V 2 = {v\ + v 2 \v\ G V\ A v 2 G V 2 }. 

The result for DLinf follows from the fact A is deterministic if A\ and A 2 are deterministic. 

D 
Theorem 4.21. 27ie (non) deterministic Disc-automata are closed under sum, with cost 
0(m -n 2 ). 

Sketch. It is easy to see that the synchronized product of two NDisc (resp. DDisc) defines 
the sum of their languages, if the weight of a joint transition is defined as the sum of the 
weights of the corresponding transitions in the two NDlSC (resp. DDisc). |_| 

Theorem 4.22. The (non) deterministic LimAvg- automata are not closed under sum. 

Proof. Consider the alphabet E = {a, b}, and consider the DLAVG-definable languages 
L a and Lb that assigns to each word w the long-run average number of a's and 6's in w 
respectively. Let L + = L a + Lt,. Assume that L + is defined by a NLavg A with set of 
states Q (we assume w.l.o.g that every state in Q is reachable). 

First, we claim that from every state q G Q, there is a run of A over a'**' that visit a 
cycle C* with average weight 1. To see this, notice that from every state q G Q, there is 
an infinite run p of A over a u whose value is 1 (since L + (w q ■ a u ) = 1 for all finite words 
w q ). Consider the following decomposition of p. Starting with an empty stack, we push the 
states of p onto the stack as soon as all the states on the stack are different. If the next 
state is already on the stack, we pop all the states down to the repeated state thus removing 
a simple cycle of p. Let C\, C 2 , . . . be the cycles that are successively removed. Observe 
that the height of the stack is always at most \Q\. Let j3 be the largest average weight of 
the cycles Cj, i > 1, and let a max be the largest weight in A. Assume towards contradiction 
that j3 < 1. Then, for all n > 0, the value of the prefix of length n of p is at most: 

cw|Q|+/3-Efeil^i 

n 

where k n is the number of cycles that have been removed from the stack when reading the 

first n symbols of p. Hence, the value of p is at most /3 < 1, which is a contradiction. 

Therefore, the average weight of some cycle C* = Ci is exactljo 1 (there are finitely many 



It cannot be greater than 1 since L+(w ■ a w ) = 1 for all finite words w. 
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different cycles as they are simple cycles). Since the height of the stack is at most \Q\, the 
cycle C* is reachable in at most \Q\ steps. 

Second, it can be shown analogously that from every state q € Q, there is a run over 
&'**' that visit a cycle C* with average weight 1. 

Third, for arbitrarily small e > 0, consider the word w and the run p of A over w 
generated inductively by the following procedure: wo is the empty word and po is the initial 
state of A We generate Wi + \ and pi + \ from wi and pi as follows: {%) generate a long enough 
sequence w' i+1 of a's after Wi such that the average number of 6's in wt ■ w' i+1 falls below 
e and we can continue pi and reach within at most \Q\ steps (and then repeat k times) a 
cycle C of average weight 1 and such that the average weight of this run prolonged by \Q\ 
arbitrary transitions is at least 1 — e, i.e. 

7(Pi) + fc-|C , | + 2a min -|Q| > 
\ Pi \ + k-\C\ + 2-\Q\ 

where a m i n is the least weight in A. This is possible since k can be chosen arbitrarily large. 
Let p\ be the prolongation of pi over w' i+1 ; (ii) then generate a long enough sequence w" +1 of 
o's such that the average number of a's in Wi • w' i+1 • w'( +1 falls below e and as above, we can 
construct a continuation p" of p\ whose average weight is at least 1 — e (even if prolonged 
by \Q\ arbitrary transitions); (Hi) the word tUj+i = u>« • w' i+l ■ w'( +1 and the run pi + \ is p'[. 
The word w and the run p are the limit of these sequences. We have L a (w) = L^w) = 
and thus L + (w) = 0, while the value of p is at least 1 — e, a contradiction. □ 
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